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Abstract 

Background: Histone post-translational modifications (PTMs) play an important role in the regulation of the 
expression of genes, including those involved in cancer development and progression. However, our knowledge 
of PTM patterns in human tumours is limited. 

Methods: MS-based analyses were used to quantify global alterations of histone PTMs in colorectal cancer (CRC) 
samples. Histones isolated from 12 CRCs and their corresponding normal mucosa by acidic extraction were 
separated by SDS-PAGE and analysed by liquid chromatography-mass spectrometry. 

Results: Among 96 modified peptides, 41 distinct PTM sites were identified, of which 7, 13, 11, and 10 were 
located within the H2A, H2B, H3, and H4 sequences, respectively, and distributed among the amino-terminal tails 
and the globular domain of the four histones. Modification intensities were quantified for 33 sites, of which 4 
showed significant (p-value < 0.05) differences between CRC tissues and healthy mucosa samples. We identified histone 
H3 lysine 27 acetylation (H3K27Ac) as a modification upregulated in CRC, which had not been shown previously. 

Conclusions: The present results indicate the usefulness of a bottom-up proteomic approach for the detection of 
histone modifications at a global scale. The differential abundance of H3K27Ac mark in CRC, a PTM associated with 
active enhancers, suggests its role in regulating genes whose expression changes in CRC. 
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Background 

Colorectal cancer (CRC) is the most common cancer in 
the Polish population, and the leading cause of cancer- 
related morbidity and mortality [1]. Most CRCs are 
sporadic, and only a small proportion is associated with 
hereditary disorders with high penetration, such as 
Lynch syndrome, familial adenomatous polyposis and 
other polyposis syndromes mediated by rare germline 
mutations in DNA mismatch-repair genes and in the 
adenomatous polyposis coli (APC) gene [2]. 

Cancer is a multi-step process involving successive 
clonal selection events. The growth advantage of dysplas- 
tic cells over their normal neighbours leads to progressive 
cytological and architectural derangement, and individual 
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cancer phenotypes are the result of cell-specific, develop- 
mental stage-specific, and metabolism-related changes in 
gene expression that occur selectively at specific times 
and are modified by epigenetic interactions [3]. Epigenetic 
changes such as DNA and histone modifications, chroma- 
tin remodelling and regulation by noncoding RNAs can 
result in massive deregulation of gene expression during 
the course of cancer development [4], The global effects 
of altered epigenetic patterns in gene regulatory sequences 
have been determined by the ENCODE project [5]. 

Histone post-translational modifications (PTMs) in- 
clude lysine acetylation, arginine and lysine methylation, 
phosphorylation, proline isomerization, ubiquitination 
(Ub), ADP ribosylation, arginine citrullination, SUMOy- 
lation, carbonylation and biotinylation [6]. The most 
common PTMs are acetylation and methylation [7]. 
Within the five main histone proteins, PTMs can occur 
at multiple positions, although they are most frequent at 
histone N-terminal tails [8]. Despite the key role of 
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epigenetic alterations in cancer development, little is 
known about the patterns of histone PTM alterations in 
human tumours [9]. 

Proteomic methods are largely based on the use of 
mass spectrometry (MS), a highly specific, effective, and 
universal technique that does not require complicated 
multi-step sample preparation. One of the most promin- 
ent features of MS is its sensitivity, which enables the 
detection of attomolar sample concentrations with an 
error of 0.01% of the total sample mass. Proteomics ana- 
lyses the composition, amounts, isoforms, and post- 
translational modifications of cellular proteins [10]. 

In the present study, we used MS -based analysis to 
quantify global alterations of histone PTMs in matched 



normal and colon cancer samples. Our results showed 
that histone H3 lysine 27 acetylation (H3K27Ac) is asso- 
ciated with colon cancer. 

Results 

Histones were isolated from 12 CRC tissues and corre- 
sponding normal mucosa, and equal protein amounts 
were separated by SDS-PAGE and subjected to qualita- 
tive LC-MS/MS and quantitative label-free LC-MS 
analyses (Figure 1). 

Qualitative histone protein analyses 

For protein identification 10 pooled samples were ana- 
lysed by LC-MS/MS for protein identification, resulting 
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Figure 1 Diagram of worlcflow to determine altered histones PTMs in colon carcinoma tissues. Histones were isolated from wliole tissue 
sections by acidic extraction using Sliecliter et al. protocol [36] followed by separation using SDS-PAGE and silver staining. The gel part containing 
histone core proteins were excised next proteins were digested with trypsin and subjected to MS analyses. MS/MS runs of pooled samples were 
performed to identify in deep the peptides that compose the collection of samples. The custom peptide database was further overlaid on individual 
2D maps acquired in LC-MS runs. Maps were then used as the basis to quantify and point modified peptides with differential abundance between 
CRC and normal mucosa samples. 
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in the acquisition of 386120 fragmentation spectra. A 
search against the SwissProt database using the Mascot 
engine confidently identified a set of 2,647 peptides with 
an estimated false discovery rate of 0.01 (Additional file 1: 
Table SI). In total, 522 proteins were identified, of which 
357 were represented by at least two peptides (Additional 
file 1: Table S2). Among the detected peptides, 285 origi- 
nated from core histone proteins, including H4, H3.1, 
H3.2, H3.3 and numerous variants of H2A and H2B. 
However, the unambiguous identification of the members 
of the two latter families was difficult. High sequence 
homology between these two families leads to the detec- 
tion of multiple shared peptides, which can be attributed 
to more than a single protein. As a result, it is not always 
possible to identify the particular proteins present in the 
samples. In the present study, the indistinguishable his- 
tone H2A and H2B subtypes were grouped into six and 
eight distinct clusters represented by variants accounting 
for all observed peptides. The final results of core histone 
protein identification are summarized in Table 1, along 
with the number of detected peptides and sequence 
coverage. A more detailed description of their peptide- 
protein dependencies is also available in Additional file 1: 
Table S3. 

Table 1 LC-MS/MS histone protein identification in gel slices 



SwissProt ACC Name Coverage [%] 

P0C0S8 Histone H2A type 1 65.89 

Q96KK5 Histone H2A type 1 -H 66.93 

Q96QV6 Histone H2A type 1-A 66.93 

Q93077 Histone H2A type 1-C 65.89 

Q6FI13 Histone H2A type 2-A 67.44 

Q8IUE6 Histone H2A type 2-B 50.39 

Q71UI9 Histone H2A.V 50.39 

P0C0S5 Histone H2A.Z 50.39 

PI 61 04 Histone H2A.X 65.49 

P62807 Histone H2B type 1-C/E/F/G/l 86.40 

P58876 Histone H2B type 1-D 86.40 

Q93079 Histone H2B type 1-H 86.40 

P06899 Histone H2B type 1-J 86.40 

Q99880 Histone H2B type 1-L 80.80 

P23527 Histone H2B type 1-0 86.40 

Ql 6778 Histone H2B type 2-E 86.40 

P57053 Histone H2B type F-S 86.40 

060814 Histone H2B type 1-K 86.40 

P68431 Histone H3.1 80.00 

Q71DI3 Histone H3.2 80.00 

P84243 Histone H3.3 80.00 

P62805 Histone H4 80.39 



The identification of 96 modified peptides allowed for 
the characterisation of 41 distinct post-translational modi- 
fication sites, of which 7, 13, 11, and 10 were located 
within the sequences of H2A, H2B, H3, and H4, respect- 
ively (Additional file 1: Table S4 and Figure 2). Multiple 
modification variants were detected on 6 sites (14.6%), 
with a maximum of four different modifications per site. 
As shown in Figure 2, the sites were distributed in the 
amino-terminal tails and the globular domain forming the 
nucleosomal core of each of the four histones. Despite the 
sequence divergence, the observed modification patterns 
were generally preserved within the H2A, H2B and H3 
families, with the exception of H2A.V and H2A.Z variants. 

Acetylation, which was observed in 36 lysine residues, 
was the most prominent of all the studied modifications. 
In 31 instances, it was the only modification detected 
on a given site, whereas the remaining 5 sites showed 
alternative modification variants, including mono-, di- 
and tri-methylation (three, four, and four sites, respec- 
tively). Modifications on arginine residues were signifi- 
cantly less frequent and only a single site of methylation 
and deamidation was detected in the sequence of the 
histone H2A. Two serine and two threonine phospho- 
rylation sites were also identified. 



Peptides 


Modified peptides 


Modification 


42 


10 


5 


42 


10 


5 


38 


6 


4 


48 


10 


5 


43 


8 


4 


13 


2 


1 


13 


2 


1 


13 


2 


1 


43 


9 


5 


51 


19 


13 


50 


18 


13 


51 


19 


13 


50 


19 


13 


49 


18 


13 


50 


19 


13 


50 


19 


13 


51 


19 


13 


51 


19 


13 


61 


28 


10 


59 


30 


10 


55 


27 


11 


42 


12 


11 



Proteins matching the same sets of peptides are Indicated by bolded text. Non-hlstone proteins are not listed. 
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Figure 2 Amino acid sequences of core histone and their forms, indicating sites of post-translational modification. The grey block marks 
the N-terminal histone tail. The alignment of protein sequences and visualization of the detected modification sites were performed using CLC 
Sequence Viewer (CLC Bio). 



A crosscheck with the PhosphoSitePlus [11] and His- 
tome [12] databases revealed that although most of the 
sites had been previously reported, in several cases our 
survey provided a more comprehensive list of their pos- 
sible modification. For example, lysine K43 of histone 
H2B, which is a known ubiquitination site, was shown to 
have an acetylated variant. A detailed summary of site- 
modification combinations not covered by the two data- 
bases is presented in Additional file 1: Table S4. 



Quantitative analysis 

Peptides identified by LC-MS/MS analyses were quanti- 
fied in individual samples (12 CRC tissue-healthy mu- 
cosa pairs) using a label-free approach and modification 
intensities were calculated for each of the detected sites 
(see Methods). We obtained 45 reliable quantitative es- 
timates of modification intensities on 33 sites. Using a 
threshold of p< 0.05, 4 sites exhibiting significant diffe- 
rences in modification intensity between the two sample 
groups were detected (Table 2), of which three were up- 
regulated and one downregulated in cancerous samples. 
Among them we identified histone H3 lysine 27 acetyl- 
ation (H3K27Ac) as a modification upregulated in CRC; 



an example of fragmentation spectrum for H3K27Ac is 
presented in Additional file 2: Figure SI. 

Availability of antibodies allowed further evaluation of 
the two selected PTMs of histone H3, namely H3K27Ac 
and K27 trimethylation (me3). Western blotting-based 
analysis of histones isolated from 12 CRC tissue samples 
and paired healthy mucosa samples, followed by densito- 
metric analyses, confirmed increased K27 acetylation at 
H3 (fold change (FC) = 1.31, p-vslue = 0.0093) (Figure 3), 
whereas no significant differences were observed for the 
H3K27me3 marker (not shown). 

MS and western blot results of H3K27Ac alteration 
were also confirmed by the immunohistochemical staining 

Table 2 List of histone modifications showing differences 
in abundance (p-value < 0.05) between cancerous (CRC) 
and normal (NC) colonic mucosa 



Histone 


Site 


Modification 


p-value 


FC CRC/NC 


H3 


K27 


Trimethyl (K) 


0.004032 


1.67 


H3 


K27 


Acetyl (K) 


0.012378 


1.54 


H2B 


S64 


Phospho (ST) 


0.016537 


1.24 


H3 


K79 


Acetyl (K) 


0.044106 


0.83 



FC, fold change. 
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Figure 3 H3K27Ac is upregulated in colon carcinoma tissues. (A) Results of H3 and H3K27Ac immunostaining in 12 pairs of CRC tissues and 
normal mucosa and graphical depiction of densitometric assessment of differences (B). Histones were isolated from whole tissue sections by 
acidic extraction and equal amounts of protein were separated by SDS-PAGE, electrotransferred to PVDF membranes and immunostained with 
H3 (abl791) or H3K27Ac (ab4729) antibody. Densitometric measurements were performed using OptiQuant image analysis software. H3K27Ac 
level was normalised to the signal from total H3. C, CRC- colon cancer; N, NC- normal colonic mucosa. 



of 10 pairs of normal and CRC formalin-fixed paraffin- 
embedded tissue samples; five tissue pairs were com- 
mon with MS and western blot analyses (Additional file 1: 
Table S5). While both normal and CRC tissue reveal- 
ed pronounced positive nuclear immunoreactivity for 
H3K27AC (Figure 4 and Additional file 2: Figure S2), the 
percentage of positively stained nuclei (labelling index), 
calculated with the use of automated image analysis 
software in the representative pictures of matched nor- 
mal/CRC sample sections, revealed higher ratio of immu- 
nopositive cells compared to normal counterpart (69.01% 
vs. 52.66% respectively, p = 0.0052) - Table 3. 

To determine whether increase in H3K27Ac mark is 
associated with proliferating cells, we measured its abun- 
dance in four resting and dividing CRC cell lines, namely 



HCT-116, Colo205, HT29 and Caco2, using the west- 
ern blot. While there were no differences in H3K27Ac 
levels between quiescent and proliferating cells, variable 
levels of that mark were found between the cell lines 
(Additional file 2: Figure S3). 

Next, we wished to determine the expression of en- 
zymes controlling H3K27Ac mark. To this end, using 
quantitative (q) RT-PCR, we measured the mRNA levels 
of enzymes controlling H3K27Ac mark abundance, namely 
CBP/p300 acetyltransferases [13,14] and HDACl deacety- 
lase [15], on 26 CRC and 24 healthy mucosa samples col- 
lected in our previous study [16]. These measurements 
revealed significant downregulation of CBP, p300 and 
HDACl transcripts in CRC (Additional file 2: Figure S4) 
with a FC of 0.37, 0.34 and 0.7, respectively. 



x100 magnification x400 magnification 




Figure 4 An example of an immunohistochemical analysis of representative matched normal colonic and CRC sections with use of 
antibody against H3K27Ac. 
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Table 3 H3K27Ac labelling Index in matched CRC and 
corresponding normal tissue sections 



Sample ID* CRC tissue Normal colonic section 



11028 


60.9 


67.1 


14585 


59.5 


55.6 


14600 


55.2 


54.9 


14616 


75.1 


46.5 


14882 


75.7 


39.2 


14935 


62.2 


43 


15244 


78.5 


49.3 


17022 


79.9 


53.9 


10991 


89 


62.4 


15732 


54.1 


54.7 



#details on the samples can be found in Additional file 1: Table S5. 



Discussion 

Histone PTMs can affect DNA-histone interactions or 
inter-nucleosomal contacts, as well as the recruitment 
of non-histone proteins to chromatin via bromo, 
chromo and PHD domains [17,18]. Thus, PTMs are re- 
sponsible for the regulation of chromatin structure and 
function, constituting the epigenetic code. Although 
challenging, traditional MS -based bottom-up analyses 
allow the identification and characterisation of PTMs 
without prior knowledge of the modification site or type 
(reviewed in [6]). 

In the present study, we used a bottom-up proteomic 
approach to investigate alterations in histone modifica- 
tions in colon cancer samples and their normal counter- 
parts. We identified 96 modified histone derived peptides, 
of which 45 site-modification combinations were further 
quantified, revealing 4 sites with differential abundance 
between cancerous and normal mucosa. For further vali- 
dation, we chose histone H3 lysine 27 acetylation because 
this modification has not been previously shown to be 
altered in CRC [19]. 

Protein immunostaining on western blots confirmed 
the increased K27 acetylation at H3 (Figure 3) and im- 
munohistochemical staining of paired CRC and corre- 
sponding normal tissue sections revealed nuclear 
localisation of acetylated H3 protein (Figure 4). Since dif- 
ferent morphology of cells of epithelial type is observed in 
two types of tissue, the percentage of immunoractive cells 
(labelling index) was calculated in the representative pic- 
tures of each sample and used for the comparison of the 
two types of colorectal tissue. We found significantly 
higher H3K27Ac index in CRC samples compared to nor- 
mal tissue which correspond the results obtained with 
both of MS and western blot. 

Acetylation of lysine residues is a major histone modi- 
fication involved in the regulation of chromatin struc- 
ture and transcription. It neutralises the positive charge 



on the lysine side chain, relaxing the chromatin struc- 
ture, and it generates docking sites for bromodomain- 
containing proteins [20]. The balance between the en- 
zymatic activities of histone lysine acetyltransferases and 
deacetylases regulates the level of histone acetylation. 
Furthermore, the global level of histone acetylation de- 
pends on intracellular acetyl-CoA pools [21]. 

Acetyltransferases consist of three families, GNAT, 
MYST, and CBP/p300, which generally act promiscu- 
ously on more than one lysine; however, some specificity 
has been observed for these enzymes [20]. Aberrant 
acetylation of histones has been linked to CRC patho- 
genesis (reviewed in [19]). Studies suggest that histone 
acetylation is reduced in CRC and in other tumours 
[22]; however, examination of specific sites shows that 
acetylation can be either up- or downregulated. For ex- 
ample, Fraga et al. used EC-LC-ES/MS and western blot 
analysis and showed a loss of monoacetylation at H4K16 
in CRC cell lines [23]. Global hypoacetylation of H4K12 
and H3K18 has been observed in undifferentiated colo- 
rectal adenocarcinomas, whereas their acetylation was 
increased in well-differentiated tumours [24]. Contrary 
to these findings, the H3K9 hypoacetylation status was 
positively correlated with tumour histological type and 
low H3K9Ac was observed in well-differentiated tu- 
mours [25]. To the best of our knowledge, aberrant 
H3K27Ac levels in CRC have not been reported to date. 
In other solid tumours, namely lung adenocarcinomas 
and squamous cell carcinomas, the increase in H3K27 
acetylation is more pronounced in the tumour compart- 
ment than in the corresponding stroma [26]. 

H3K27Ac was first discovered in yeast [27] and is 
present in animals and plants [28] . Recent advancements 
in DNA sequencing technology have enabled the ana- 
lysis of histone modification distribution patterns across 
the genome. These studies have shown that among his- 
tone acetylation marks [29], H3K27Ac is frequently as- 
sociated with active enhancer regulatory elements [30], 
and genes associated with these enhancers are expressed 
at higher levels than those lacking the H3K27Ac mark 
[31]. The H3K27AC mark is established by CBP/p300 
acetyltransferase [13,14] and is likely erased by RBP3/ 
HDACl [15]. Contrary to the report by Ishihama et al. 
who found CBP/p300 and HDACl mRNAs [32] upregu- 
lated in CRC, our qPCR measurements showed decreased 
levels of these transcripts (Additional file 2: Figure S4). 
The discrepancy could be due to the differences in the 
methodologies of transcripts measurements; while Ishi- 
hama et al. used both semi-quantitative endpoind RT-PCR 
and in situ hybridisation [32], in this study we used qPCR 
which is regarded as a gold standard for the quantification 
of nucleic acids [33]. Although other reports highlighted 
higher abundances of CBP/p300 and HDACl proteins in 
CRC tissue [34,35], it must be stressed that the alterations 
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in the expression of these epigenetic enzymes are not en- 
tirely responsible for specific histone modification levels, 
because other variables such as substrate availability and 
enzyme activity may contribute to the final modification. 

Conclusions 

In conclusion, for the first time, we show that H3K27Ac 
mark is increased in CRC. Further experiments, which 
are beyond the scope of this study, such as chromatin 
immunoprecipitation followed by deep sequencing (ChlP- 
Seq) in colon tissues to examine the distribution of the 
H3K27Ac mark, may identify enhancer regulatory sites 
that differ in mark levels between normal and cancerous 
samples. Such datasets combined with increased know- 
ledge on genome regulatory elements and long genomic 
interactions deposited in the ENCODE database [5] 
could potentially lead to the identification of genes 
whose expression changes are associated with altered 
H3K27AC status. 

Methods 

Tissue samples 

Patients were selected as described previously [4]. The 
study protocol was approved by the Cancer Center 
Bioethics Committee, and all patients signed informed 
consent forms before inclusion. Twelve sporadic CRC 
samples and paired full-thickness normal colon frag- 
ments were obtained by surgical resection through lapar- 
otomy, snap frozen in liquid nitrogen within 10-30 min 
of harvesting and stored at 72°C until use. To select 
samples with a high content of normal and dysplastic 
mucosa, several series of cryostat sections were prepared 
from different parts of each specimen using a Microm 
HM 505E (Zeiss, Germany). Histological evaluation of 
the upper and lower sections from each cryosection col- 
lection revealed a 70% median-relative content of normal 
mucosa (range 40 - 90%) and 50% (20 - 90%) dysplastic 
mucosa in the specimens. Clinical characteristics and 
histopathology of the tissue samples used in a study is pre- 
sented in Additional file 1: Table S5. 

Colon adenocarcinoma cell lines 

HCT-116, Colo205, HT-29 and Caco2 cell lines were 
purchased from ATCC and maintained under standard 
conditions in culture medium (HCT-116 and HT-29 - 
McCoys 5A Medium; Colo205 - RPMI-1650 medium; 
Caco2 - DMEM medium) supplemented with 10% PBS 
(Fetal Bovine Serum). Cells were made quiescent by low- 
ering PBS concentration to 0.5% PBS for 24 h prior to 
harvest. 

Histones extraction 

Histones were isolated from whole tissue sections by 
H2SO4 extraction [36] and equal amounts of sample 



protein (20 \ig) were separated by standard 15% SDS- 
PAGE and silver stained. Gel fragments containing H2A, 
H2B, H3, and H4 bands were cut out separately for each 
electrophoresis line; proteins in the gel were reduced, 
alkylated, and trypsin-digested using standard protocols, 
and the resulting peptides were extracted using 0.1% 
TPA/2% acetonitrile (ACN). 

Western blot and densitometric data analyses 

Por western blotting, 5 [ig of each protein sample was 
resolved by SDS-PAGE and electrotransferred to a PVDP 
membrane. Blotted proteins were detected using the 
following antibodies: H3 (abl791. Abeam), H3K27Ac 
(ab4729. Abeam), H3K27me3 (07-449, Millipore). Pro- 
tein band intensities were assessed by densitometry 
using OptiQuant image analysis software (Packard). Data 
were normalised to the amount of total histone H3 and 
expressed as fold change in CRC tissues versus control 
samples. The statistical significance of histone modifica- 
tions between two tissue types, as detected by western 
blotting, was determined with the nonparametric Wil- 
coxon signed-rank test, and j^-values of < 0.05 were con- 
sidered significant. 

Quantitative RT-PCR 

Measurements of mRNA abundances were performed on 
the 26 CRC and 24 healthy mucosa samples and with a q- 
RT-PCR protocol described previously [16]. The se- 
quences of primers used were as follow: HDACl, forward 
5 - CAATGAAGCCTCACCGAATC-3', reverse 5 - CTC 
CTCAGCATTGGCTTTGT-3'; CBP, forward 5 - GTTTC 
CCCGCAAATGACTG-3', reverse 5 - CTGCCCTCCAG 
CTTGACTAA-3'; p300, forward 5 - CATGAATCCAGG 
GCCTAACA-3', reverse 5 - CGGATCATACTTGGGTC 
AGG-3'. 

Immunohistochemistry 

Staining was performed in 4 \im formalin-fixed, paraffin- 
embedded tissue sections of CRCs and matched normal 
mucosa from 10 patients (Additional file 1: Table S5) 
with the use of Envision Detection System (DAKO). Sec- 
tions were deparaffinized with xylene and rehydrated in 
a series of decreasing concentration of ethanol solutions. 
Heat-induced epitope retrieval was carried out in Target 
Retrieval Solution (pH 6) (DAKO) in a 96°C water bath, 
for 20 minutes. After cooling retrieval solutions for 
25 minutes at room temperature, the slides were treated 
for 5 minutes with Blocker of Endogenous Peroxidase 
(DAKO). Slides were incubated with anti-H3K27Ac 
(ab4729. Abeam) (diluted 1:500) for 30 minutes at room 
temperature and subsequently labeled with the Envision 
Detection System (DAKO). Color reaction product was 
developed with 3,3'-diaminobenzidine, tetrahydrochlo- 
ride (DAB) (DAKO) as a substrate, and nuclear contrast 
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was achieved with hematoxylin counterstaining. Represen- 
tative pictures of each sample were taken at magnification 
x400 and used for the automatic calculation of the per- 
centage of positively stained nuclear area (labeling index). 
For this purpose ImmunoRatio i.e. softwere for automated 
image analysis [37] was used and the results for normal 
samples and CRC sections were compared. 

LC-MS settings 

LC-MS analysis of histone peptides was performed used 
a LTQ-Orbitrap Velos mass spectrometer (Thermo Scien- 
tific) coupled with a nanoAcquity (Waters Corporation) 
LC system. Spectrometer parameters were as follows: po- 
larity mode, positive; capillary voltage, 1.5 kV. A sample 
was first applied to the nanoACQUITY UPLC Trapping 
Column (Waters) using water containing 0.1% formic acid 
as the mobile phase. Next, the peptide mixture was trans- 
ferred to the nanoACQUITY UPLC BEH C18 Column 
(Waters, 75 [im inner diameter; 250 mm long) and an 
ACN gradient (5-40% over 100 min) was applied in the 
presence of 0.1% formic acid with a flow rate of 250 
nl/min and eluted directly to the ion source of the mass 
spectrometer. Each LC run was preceded by a blank run 
to avoid sample carry-over between the analyses. 

Qualitative LC-MS/MS analyses were performed on 
pooled samples in data-dependent acquisition mode. Up 
to 5 MS/MS processes were allowed for each MS scan, 
and high-energy collision dissociation (HCD) was used 
for peptide fragmentation. Quantitative analyses of indi- 
vidual samples were performed by using separate survey 
scan LC-MS runs with a m/z measurement range of 
300-2,000 and the same ACN gradient settings as those 
used for the LC-MS/MS runs. The data-dependent MS- 
to-MS/MS switch was disabled, and the spectrometer 
resolution was set to 15,000. 

Qualitative MS data processing and database search 

The acquired MS/MS raw data files were preprocessed 
with Mascot Distiller (version 2.2.1, Matrix Science), and 
the resulting peak lists were searched against the Homo 
sapiens entries of the SwissProt database (version 
05.10.2012, 20306 sequences) using Mascot (version 
2.2.03, Matrix Science). The search parameters were as 
follows: enzyme specificity: semitrypsin; maximum num- 
ber of missed cleavages: 2; protein mass: unrestricted; par- 
ent ions mass error tolerance: 5 ppm; fragment ions mass 
error tolerance: 0.02 Da; fixed modifications: Carbamido- 
methylation (C); and variable modifications: Acetyl (K) 
(42.010565 Da), Methyl (K) (14.015650 Da), Dimethyl (K) 
(28.031300 Da), Trimethyl (K) (42.046950 Da), Methyl (R) 
(14.015650 Da), Dimethyl (R) (28.031300 Da), Deamidated 
(R) (0.984016 Da), Phospho (ST) (79.966331 Da) and Oxi- 
dation (M) (15.994915 Da). 



The statistical significance of peptide identifications 
was determined using a target/decoy database search ap- 
proach and a previously described procedure that pro- 
vided ^-value estimates for each peptide spectrum 
match (PSMs) in the data set [38,39]. Only PSMs with 
^-values < 0.01 were regarded as confidently identified. 

Additional acceptance criteria were used for assessing 
confidence of modified peptides. In the first step, the 
exact position of the modifications in the sequence was 
established by an adopted version of the phosphoRS al- 
gorithm [40]. Next, the MS/MS spectra were inspected 
manually for accurate fragment ions assignment. Finally, 
selected types of sites were rejected as potential experi- 
mental artifacts. Those included: lysine methylations on 
the C-terminus of the sequence or detected in peptides 
with acidic residues (possible artifacts of methyl esterifi- 
cation of the carboxylic group) and peptides with deami- 
dation on the C-terminal arginine (tryptic cleavage after 
a deamidated residue have been recently shown as a 
highly unlikely event [41]). 

Proteins represented by less than two peptides, or 
identified by a subset of peptides from another protein, 
were excluded from further analysis. Proteins matching 
the same set of peptides were grouped together into 
clusters. All the steps involved in Mascot results pro- 
cessing were performed using MScan, a proprietary Java 
application available at http://proteom.ibb.waw.pl/mscan. 
Multiple alignment of protein sequences and visualization 
of the detected post-translational modification sites were 
performed using CLC Sequence Viewer (CLC Bio). 

Quantitative MS data processing 

Peptides identified in all LC-MS/MS runs were merged 
into a common list, which was next overlaid onto 2-D 
maps generated from the LC-MS profile data of individ- 
ual samples. The feature extraction procedure was de- 
scribed in detail in a previous study [42]. Briefly, the list 
of identified peptides was used to tag the corresponding 
peptide-related ion spectra based on m/z differences, de- 
viations from the predicted elution times, and the match 
between the theoretical and observed isotopic envelopes. 
The maximum deviation accepted in m/z and the retention 
time were established separately for each of the processed 
LC-MS spectra to account for possible variations in mass 
measurement accuracy and chromatographic separation 
between runs. First, an initial search with wide tolerance 
and restrictive parameters of isotopic envelope fits was per- 
formed. Next, nonlinear mass and time calibration func- 
tions were calculated using LOESS regression, and the 
search was repeated with narrowed tolerances and relaxed 
fit requirements. Finally, relative abundances of peptide 
ions were determined as the heights of 2-D fits to the 
most prominent peaks of the tagged isotopic envelopes. 
For normalisation purposes, the calculated abundance of 
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each peptide was divided by the median abundance of all 
the peptides detected in the sample. 

Given the normalised peptide abundances, quantitative 
values (further referred to as "modification intensities") 
were calculated for distinct post-translational modifica- 
tion types observed on each of the previously identified 
sites. These values were computed using a procedure 
that involved rescaling of the abundances of single- 
modified peptides covering the site of interest to a com- 
mon level, followed by computing their median value. 

Statistical analysis of quantitative MS measurements 

A non-parametric resampling-based test with paired t statis- 
tics was used to evaluate the differences in site-modification 
intensities between the two groups of samples. Modification 
sites with ^-values < 0.05 were considered as significantly 
changed. 

Additional files 



Additional file 1: Table 51. The list of peptides identified in MS/MS 
analyses with estimated FDR = 0.01 . Table S2. The list of identified 
proteins together with their MS/MS-related details. SCORE - MascotScore; 
PEPT# - number of peptides assigned to protein. Proteins matching to 
the same sets of peptides were grouped into unique clusters and 
presented as single rows of the table. Table S3. List of identified core 
histone-derived peptides. Table S4. List of identified postranslational 
modification sites core histone proteins. Site-modification combinations 
not present in PhopshoSitePlus and/or Histome databases are marked in 
red. Table S5. Clinical characteristics and histopathology of the tissue 
samples used in a study. 

Additional file 2: Figure SI. An example MS/MS spectrum of the 
peptide K(Ac)SAPATGGVK derived from the H3 histone proteins family. 
The amino acid sequence of the peptide includes the lizyne K27 residue. 
The plot was generated using the ExpertSystemGui application available 
at (http://www.biochem.mpg.de/mann/tools/). Figure 52. 
Immunohistochemical staining of 10 matched normal and CRC tissue 
sections with use of antobody against H3K27Ac. Magnifications lOOX and 
400X. Figure S3. H3K27Ac mark level in quiescent and proliferating CRC 
cell lines. Cells cultured for 24 h with 10% or 0.5% FBS were harvested, 
histones isolated by acidic extraction and then 5 |jg of protein was 
resolved by SDS-PAGE and electrotransferred to PVDF membrane. Blotted 
proteins were assessed by Western blot analysis using the antibodies to 
H3 (ab1791) or H3K27Ac (ab4729). Densitometric measurements were 
performed using OptiQuant image analysis software. H3K27Ac level was 
normalised to the signal from total H3 and presented of the chart. 
Figure S4. Analysis of mRNA expression levels of CBP, p300 and HDACl 
in individual tissue samples of 26 adenocarcinomas (CRC) and 24 healthy 
mucosa's (NC). One microgram of total RNA was reverse-transcribed 
to generate cDNA and then qPCR was performed using SYBR Green I 
chemistry. Green horizontal bars indicate means and red whiskers 
indicate standard deviation. Differences were analyzed using the 
Mann-Whitney test. 
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